Arabic Nested Noun Compound Extraction Based on Linguistic Features and Statistical Measures

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collocational Translation Memory Extraction Based on Statistical and Linguistic Information

In this paper, we propose a new method for extracting bilingual collocations from a parallel corpus to provide phrasal translation memories. The method integrates statistical and linguistic information to achieve effective extraction of bilingual collocations. The linguistic information includes parts of speech, chunks, and clauses. The method involves first obtaining an extended list of Englis...

متن کامل

Automatic Arabic Text Summarization System Based on Semantic Features Extraction

Recently, one of the problems arisen due to the amount of information and it’s availability on the web, is the increased need for effective and powerful tool to automatically summarize text. For English and European languages an intensive works have been done with high performance and nowadays they look forward to multi-document and multi-language summarization. However, Arabic language still s...

متن کامل

Automatic Phonetization-based Statistical Linguistic Study of Standard Arabic

Statistical studies based on automatic phonetic transcription of Standard Arabic texts are rare, and even though studies have been performed, they have been done only on one level – phoneme or syllable – and the results cannot be generalized on the language as a whole. In this paper we automatically derived accurate statistical information about phonemes, allophones, syllables, and allosyllable...

متن کامل

Influence of accurate compound noun splitting on bilingual vocabulary extraction

The influence of compound noun splitting on a German-Polish bilingual vocabulary extraction task is investigated. To accomplish this, several unsupervised methods for increasingly accurate compound noun splitting are introduced. Bilingual evidence from a parallel German-Polish corpus and co-occurrence counts from the web are used to disambiguate compound noun analyses directly. These collected ...

متن کامل

Atrial Activity Extraction Based on Statistical and Spectral Features

Atrial fibrillation is the most common human arrhythmia. The analysis of the associated atrial activity provides features of clinical relevance. Previously, the extraction of the atrial signal is necessary. We follow the semi Blind Source Extraction S-BSE approach to solve the problem. The proposed algorithm satisfies the prior knowledge about the atrial signal: its statistical properties and i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: GEMA Online® Journal of Language Studies

سال: 2018

ISSN: 1675-8021,2550-2131

DOI: 10.17576/gema-2018-1802-07